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Abstract 

The reading achfevement test performance of sixth*grade students in 
contrasting reading programs was examined* One program (Hatteson) is 
an explicitly sequenced^ behavioraiiy oriented skills program: the 
other (Scott Foresman) places more emphasis on rich language experience^ 
and less on specific skill development* An aptttude-treatment interaction 
was found: higher ability students in the Scott Foresman program 
outperformed those in the Hatteson program^ while lower ability students 
in the Hatteson program scored higher than those in Scott Foresman* 
Explanations and implications are discussed. 
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Achievement Outcomes of Two Reading Programs: 
An Instance of Aptitude-Treatment Interaction 

Recently^ several educational researchers have been attempting to 
examine a phenomenon that teachers have observed tor some time: the fact 
that different Instructional programs have quite different effects On 
students of differing aptitudes. Some time agO» Cronbach (l957) argued 
for the coalition of correlational and experimental psychology through 
the study of interactions between Instructional treatments and student 
aptitudes. Such Interactions have been teniied by Cronbach and others 
aptitude-treatment Interactions (ATI)* Cronbach and Snow (l977) have 
defined aptitude and treatment in very broad terns. Aptitude can refer 
to personality and strategy preferences as well as to ability measures; 
treatment covers any aspect of the Instructional environment^ Including 
such variables as pace and method of presentation^ as well as teacher 
characteristics * 

Cronbach and Snow's (l977) review of ATI research shows^ however^ 
that a combination of trivially brief trea^nts^ inadequate design and 
analysis techniques^ and general uncertainty about how to study the 
phenomenon have all led to many disappointingly doubtful and Inconsistent 
results. There were^ nonetheless » enough clear Instances of ATI for 
Cronbach and Snow to conclude that more careful and relevant work could 
yield significant findings for researchers^ curriculum designers^ and 
educational policy makers. To enhance future research^ Cronbach and 
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Sncw suggested three Issues for concentration: (a) the examination of 
the most plausible ATI hypotheses in long*duratio)t» real^school studies^ 
(b) the development of designs and analytic methods capable of handling 
such complex studies^ and (c) the development of a laboratory science 
for the analysis of aptitude tests and learning tasks» and the ATI con* 
structs based on them* 

The goal of the present study is to contribute to the first-mentioned 
effort^ that is» the examination of a plausible ATI in a real<^schoot 
settlng/as a result of long*term Instructional programs* The programs 
to t>e examined are two fourth-grade to sixth-grade reading programs* The 
aptitude construct Is represented by two distinct scores-^^an overall 
(verbal plus nonverbal) Intelligence test score* and a reading pretest 
score* given two years prior to the testing session* immediately prior 
to the students* entry Into the separate reading program. 

The ATI hypothesis to be considered may be stated in general terms 
cs follows: Instructional methods differ in the amount of information 
processing effort they demand of* or provide for* the learner* The greater 
the Information processing burden on the learner in a given program* the 
more the high-abillty student benefits* and the less the lcM-abi!Ity 
student gains* Conversely* the more the Instructional format relieves 
the learner of difficult processing* the smaller is the difference in . 
the performance of high*' versus low-ability students (Snow* |978)* This 
issue has often appeared In educational debate in varying terminology*- 
the optimal degree of structure* the benefits of discovery versus tradi- 
tional versus programmed methods* and so on* All these terms » Including 
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infonnation*processfng burden^ gloss over the fssue of precisely where 
the critical differences lie. They may reside largely Indifferences in 
the degree to which strategies are trained^ sequenced^ or accessed for 
particular tasks* or in the degree of expllcitness of expected responses^ 
or In the amount of clear signaling of relevant knowledge. 

While there has been little progress in analyzing how the above 
factors contribute tc ihe informatlon-processing burden^ there has been 
some agreement across a range of studies that this global instructional 
characteristic does affect students of different aptitudes differently. 
This effect has been shown to hold in the teaching of arithmetic procedures 
(Thiele^ 1938) and Introductory probability theory (Greeno & Mayer » Note l)< 
In both cases^ 'Waningftil" or discovery methods were found to enhance 
the performance of hfgh-abillty students^ while drill and rule*learning 
methods led to improved perfonnance among lower ability students. On 
a broader scale» Sharps (cited in Cronbach £ Snow^ 1977) examined differ- 
ences for fifth-grade students on a number of achievement measures arising 
from two long*t3rm programs: a conventional teaching program and an 
Individually Prescribed Interaction (IPI) program* The IPI program 
entailed pretests^ specified objectives and sequences leading to these 
objectives^ with frequent mini-tests for feedback and branching purposes. 
The results showed a strong interaction between verbal ability plus 
achievement and the two Instructional programs. The general ATI 
hypothesis was confirmed: for the lower ability students^ the IPI method 
was more beneficial; for the higher ability students^ IPI was less 
effective than the traditional program. 
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As Cronbach and Snow ()977) emphasized^ the area of reading has 
received a lot of attention from researchers^ but most comparisons of 
reading programs have not been designed to examine^ 1ong*^term ATI effects^ 
There are few clear findings^ Bond and Dykstra (1967) reported results 
from the massive Co-operatfve Reading Program study of first-grade reading 
programs. They reported no discernible ATI effects.. Some questions about 
sample groupings and data analysis^ however^ led Cronbach and f tow 
and Lo (cited in Cronbach t Snow» 1977) to reanalyze portions of the data. 
In these reanalyses^ some AT) effects were found. Language-experience 
programs, for instance, led to more striking differences between high- 
and low-ability girls than dtd whole-word approaches. The fact that this 
relation did not hold for boys Illustrates Cronbach and Snow's conclusion 
that, while several results were suggestive of ATJ, the effects were by 
no means convincing. 

The aim of this study is to examine the possibility of ATI, at a 
broad level, in two reading programs. Current programs differ in the 
degree to which they explicitly sequence and structure tasks. Some 
programs explicitly train, sequence, and maintain skills and knowledge 
such as letter-sound correspondence, new word meanings, disambiguation 
skills, cross- sentence Integration, and Inferencing. We might hypothesize, 
then, that in the comparison of two long-term reading programs, lower 
ability students will show better performance in the program which pro* 
vides more explicit and detailed direction than they wi II in iess directive 
programs. Higher ability students will show the reverse trend, if the 
hypothesis holds. 
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Method 

Subjects 

The students tested comprised all those sixth-graders ( N = 180) in 
a large school district In the greater Chicago area who had been In their 
present reading program for two years or longer* The area Is^ for the 
most part^ mlddle-class residential. There were 96 students In the 
Matteson program and 84 In the Scott-Foresman program* 

The Reading Programs 

The Matteson Four-DImensIonal Reading Project* The Matteson Program 
Is a behavlorally oriented skills program* As stated In the Teacher^s 
Guide (Schlenker^ 1978) the program Includes: (a) a list of behavioral 
objectives for grades three through eighty (b) Individualized learning 
packages covering 108 behavioral objectives In the major skill areas 
(word recognition* vocabulary developntent « literal comprehension* inter- 
pretive comprehension* analytical conprehenslont study skills^ and compo* 
s]tIon)t (c) criterion-referenced pretests* posttests* and subsklll tests 
for each level of the program* (d) individual pupil progress charts and 
classroom progress charts* and (e) commercial materials to be used wt th 
learning cards * 

The program entails branching and repetitions when test scores 

dictate such* The behavioral objectives are often stated In quantitative 

terms* Some examples are: 

1* Word discrimination: Given a list of 44 Dolch sight words* 
the student ^vtll pronounce correctly the words* 
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2. (A7) Analogy: Given a pair of words that are analogous^ the 
student will identify correctly another pair of words that are 
related in the same way* 

The Hatteson program ts strictly sequenced and closely monitored* 
In Snow's (1977) terms» the program seems to be one which» relative to 
other programs^ relieves the learner of much difficult and unspecified 
processing* In theory at leasts subskllls are mastered before dependent 
higher-order skills are developed* 

The Scott Foresman Reading System* While the Hatteson program 

includes in its overview some general suggestions for free reading and 

supplemental language experiences^ the Scott Foresman program has as its 

central goal the student's interaction with the concepts and ideas put: 

forth tyy an author* The emphasis Is on rich language experience and 

frequent discussions aimed at the literal and inferential significance 

of discourse* A paragraph from the introduction to the Teacher's Guide 

to Level 13 ( Windows 1972) gives the flavor of the place of specifrc 

skills in this program: 

For the few pupils who may need detailed guidance as they read» 
suggestions are given within particular selections* Other 
guidance points up a specific skill or gives help with concepts 
and words or phrases that may be unfamiliar* The latter might 
be used with pupils who read independently as well as those who 
read under guidance* (p* 1) 

After each passage^ students complete test questions which relate to 

vocabulary and literal and inferential comprehension^ but the program it* 

self contains no strict sequencing or criteria for progress* 
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Two points need to be made here. Firsts as already stated^ both 
programs entail both skill butld{r>9 and rich language experience. It is 
at the level of the central goals of the programs that clearly differing 
emphases can be detected. These differences relate to sequencing^ feed- 
back to student and teacher^ and degree of specificity of the skill and 
the expected performance. It is these aspects whlch» tt ts hypothes ized» 
characterize the difference In processing burden on the learner. Second, 
the teacher's guides are not a direct measure of what actually goes on 
in the classrooms when these programs are used. It is possible^ though 
unlikely^ that some teachers use the Scott Foresman program in a strictly 
sequenced and monitored fashion^ making very explicit the goals in each 
section. Similarly^ the Natteson program could be used merely as a 
remedial adjunct to an experientially oriented program. These modifica- 
tions would» however^ be contrary to the spirit and the letter of the 
guides. In addition^ we were assured that the teachers in this district 
used the programs as the guides indicate. 

The Ability and Heading Tests 

The students' verbal and non*verbal abilities were measured with the 
elementary levels form R of the Educational Development Series (Sciiolastic 
Testing Service^ 1976)- Form R inctudec* non-verbal (figure matching) and 
verbal (word knowledge and use) measures of ability. 

The reading tests given were the appropriaM level reading subscales 
of the Educational Development Series. The pretest used was that 
administered to the students at the end of grade four. The posttest was 
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the appropriate form administered at the end of grade six to the same 
180 students. The reading tests entailed literal^ inferential » and main 
idea comprehension questions^ tn multiple choice format^ following brief 
passages and poems. The passages consisted of narrative and expository 
discourse. In the sixth^grade test» for instance^ there were five passages 
(two narrative^ three expository) and one poem and 50 multiple choice 
questions in all. 

Results 

Before presenting the major findings concerning the effects of these 
two programs^ the initial comparability of the two groups needs to be 
established^ since^ for some unexplained reasons^ the interaction between 
aptitude and program might have existed at the point of entry to the pro*^ 
gram. Any interactions between aptitude and program on posttests would» 
in this case» be artifactual. 

Means and standard deviations for the two groups are presented in 
Table 1. The only contrasting point is the slightly larger standard 

Insert Table I about here. 

deviation on ability scores among the Scott^Foresman students. It is also 
important to establish the similarity of correlations among the measures 
for the two groups. Tables 2 and 3 show that» with two exceptions^ these 
correlations do not differ significantly. The two exceptions are the 

Insert Tables 2 and 3 about here. 



ERLC 



Achievement Outcomes 
10 

correlations between pretest and number of years in the programs (z = 3.539)* 
and the correlations between posttest and ability (z = 2J97)- ''We have 
no neat explanation of the First difference. The second, however, strongly 
suggests an interaction between aptitude and programs on posttest scores. 

Since there was a sKght difference in ability variances for the two 
groups, the regression slopes for ability on pretest scores were computed. 
For the Hatteson group, the slope was .57 and» for the Scott-Foresman group, 
it was .54. This difference was Judged to be trivial. Any interaction 
between ability and program on posttest scores could not be attributable 
to an interaction existing prior to the instructional treatment. 

Tables 2 and 3 show that sex did not correlate highly with any other 
measures, tn a regression analysis* sex was found not to be a significant 
predictor. It did not interact significantly with course or ability, 
nor were there any significantly different correlations between the 
measures when the coefficients were computed separately for boys and 
gtrls. Thus sex was dropped from the remaining analyses. 

The aptitude-program interaction was confirmed using multiple regres- 
sion analysis. Posttest scores were predicted for each reading group 
separately from the equations presented in Tables 4 and 5- tt can be 

Insert Tables 4 and 5 about here. 

seen that* for the Hatteson students* posttest is predicted by both 

pretest and ability. For the Scott-Foresman group* on the other 

hand* the only significant predictor of posttest performance is ability. 
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A sfgnfficant aptitude-^treatment interaction was found and is presented 
in Tables 6 and 1. From Table 6 it can be seen that ability, pretest, and 
the course by ability interaction contributed signif icently to the predic- 
tion of posttest scores. In total, over 50% of the variance on the posttest 

Insert Tables 6 and 7 about here. 

was predicted. Of particular interest here is the course by ability inter* 
action. Table 7 indicates this effect. Mid^polnts along both the ability 
and pretest continua were selected, and the predicted posttest scores for 
Matteson (H) and Scott Foresman (SF) students are presented for purposes 
of illustration. These predicted scores were computed using the equations 
presented in Tables k and 5. It Is the case that lower^abl I i ty students who 
perform as well as or better than their ability scores would predict on 
the pretest, perform better In the Matteson program. It is equally 
clear that higher^abi I Ity students who perform as well as or worse than 
their ability scores would predict, perform better in the Scott Foresman 
program. Interestingly, the medlum'^abil I ty student who performs poorly 
on the fourth*grade pretest is predicted to do better in the Scott 
Foresman program, while the medium-ability student who performs particularly 
well on the pretest is predicted to do better In the Matteson program. 
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Dtsciission 

The hypothesis of Interaction between ability and instructional 
condition is confirmed. Lower ability students benefited from the 
Hatteson program* higher ability students from the Scott Foresman program. 
We infer that this finding represents an affirmation of the general 
hypothesis that a program ^ich relieves the learner of much of the burden 
of processing, by assuming the acquisition* maintenance^ and checking of 
irT4>ortant subskllls* wilt be of greater benefit to lower ability students 
than will a less strictly organized program emphasizing higher-order 
skills and more conceptual Interaction with reading materials. Higher 
ability students* on the other hand* will benefit more from the latter 
Program. Thus* as Snow (1977) suggested, a highly structured program 
tends to "flatten out" the relationship between performance and ability. 

Some speculation about why this occurs is called for. An explana* 
tion may tie in the construct of 'Vnetacognltion" that has been developed 
in recent theory (see Flavetl, 1976}- It has been suggested that ability 
test performance does not relate only to the amount of information possessed 
by a student but also to the student's ability to organize knowledge and 
ski 1 1s. Sens} t ivi ty to one's errors while reading* knowing when to use 
certain strategies to enhance recall^ locating gaps In knowledge while 
studying, and soon» are all executive processes rather than knowledge 
about reading per se. 

If a large portion of the differences between high- and Iow*abIlity 
students is thought to entail deficits In executive functions* then a 
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program that takes on itself responsfbfU ty for many of these functions 
vfould lessen the differences between students of different abilities. 
The 1ow*abl1lty students would be helped, and It may be that higher 
ability students, who have developed their own ways of organ!z!r>9 their 
readir>9 and learning, find the explicit and ubiquitous organizational 
aspects of the highly structured program an interference* A program 
vrftlch takes such executive skills for granted and concentrates on language 
enrichment enhances the learning of the high-abltlty students and deprives 
the lower ability students of the organizational supports they need. 

Some predictable qualifications need to be made to these Interpre* 
tatlons. First, while the sample Is large, there may be something 
distinctive about this district or these students which would detract 
from the rept Icabili ty of such a result. Second, the notion of aptitude 
needs closer examination. There may be certain skills not assessed by 
the pretests which render the samples different In an Important my 
at their time of entrance Into these instructional programs. Third, 
to reiterate an earlier point, we need to know more precisely how 
strictly the program specifications were adhered to. There may 
have been some mixing of techniques along an '*experlence*based/skil1* 
based" continuum. These qualifications amount to suggestions for further 
research to clarify and detail the relationships found here. 

These findings have iinpl Ications for reading educators: Both types 
of programs should be available to a student, and initial placement should 
depend upon entering ability. The obvious danger of such a policy is 
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that thfs placement can become Incarceration. Thfs can be avoided by 
careful and frequent monitoring of riding performance and flexibility In 
the movement of students between programs. 

Researchers and evaluators of reading Instruction wMI also find 
Implications In these results. Huch research on reading Instruction is 
of the horse-race sort (see» for instance^ Bowers.^ l974» and the review 
by Samuels S Schachter» 1978)- The overall means of two programs are 
compared. These results strongly suggest the soundness of Cronbach and 
Snow's (1977) advice to regard separarely the performance of students of 
different aptitudes. 
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Table I 

Means and Standard Deviations 



Hatteson Group « 96 Scott-Foresman Group » 8^ 
H SD^ H SD^ 

Ability I1ii.6l \tiM n3.7^ 15<27 

Pretest 23.27 13.15 2^1.92 12.99 

Posttest 30.53 iO.Ol 31.02 JO. 21 
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Table 2 

Correlations for the Matteson Group 



12 3 ^ 5 

Y.t.P. Sex Ability Posttest Pretest 

Years in Program 

Sex .0^1 

Ability .10 -.03 

Posttest .10 .0^1 .58 

Pretest .20 -.02 .62 .6) 
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Table 3 

Correlations for the Scott-Foresman Group 



12 3 k 5 

Y.t.P. Sex Ability Posttest Pretest 

Years in Program 

Sex .00 

Ability .33 .18 

Posttest .I.8* .20 .76 

Pretest .63 .13 .64 .50 
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Table 

Regression Weights, Partitioning of Variance, 
and Significance Tests for Posttest 
(Matteson, N =« 96) 



Variable 


B Weight 


% Variance 


F 


Ability 


.35 


3^1.08 


55.81* 


Pretest {gr. ^) 


.31 


10.13 


16.59" 


Years in Program 


5.62 


. .02 


< 1 


Y.I.P. X Ability 


-.05 


.13 


< 1 


Intercept 


-16.39 






Residual 




55.56 




R » .666; F(^l,91) 


= 18. U; £ < 


.01 
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Table 5 

Regression Weights, Partittonmg of VarUnce, 
and Signff icance Tests for Posttest 
(Scott-Foresman, N » 84) 



- Varrabte 


B Weight 


% Variance 




Abt U ty 


.28 


57.16 


108.57" 


Pretest (gr. k) 


.08 


.03 


< I 


Years In Program 




1.08 


2.05 


* 

y.l.P. X Ability 


.07 


.14 


< I 


Intercept 


5.77 






Residual 




41.59 




ft - .764; F(AJ9) 


- 27.74; £< 


.01 





E< .01 
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T^le 6 

Regression Weights^ Partf tinning of Variance^ 
and Significance Tests for Pdsttest (N =» 180) 



Variable 


S Weight 


% Variance 


F ■ 


AbrUty 


.23 


4^1.36 


155.33" 


Pretest (gr. 4) 


.32 


3.29 


ft 

n.52 


Years In Program 


-t,05 


.12 


< 1 


Course 


-22.48 


.22 


< t 


Course x Pretest 


-.28 


.33 


).)6 


Course x Abf Hty 


.27 


2.27 


7.95" 


Intercept 


-.47 






Residua) 




49.4) 




R = .711; F(6» 


173) » 29.53; £< 


.0) 





"p < .01 
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Table 7 

Predicted Means for Students at Various Combinations 
of Pretest and Ability in the Two Reading Progranis 



Pretest 


Verbal and fton-Verbal AbUtty Combined 


Grade 4 


90 


no 


130 


12 


22.00^(M)^ 
19.39 (SF)*" 


26.44(M) 
28.80(SF) 


This combination not 
1 tkely to occur. 


24 


25.74 (M) 
20.38 (SF) 


30.I8(M) 
29.80(SF) 


34.62(M) 
,39.22 (SF) 


36 


Thfs conibfnation not 
1 Ikely to occur. 


33.92(M) 
30.80(SF) 


38.36(M) 
40.21 (SF) 


Note. 


Figures are based on 
either program: 2.6 


th'e student wl th 
years. 


the average stay in 



^ Standard error of estimate (Sest) v/as 7.^7 for the Hatteson 
group and 6,59 for the Scott*Foresman group* 

b 

Hatteson 
^ Scott-Foresman 
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